Decision making with inference and learning methods
نویسندگان
چکیده
In this work we consider probabilistic approaches to sequential decision making. The ultimate goal is to provide methods by which decision making problems can be attacked by approaches and algorithms originally built for probabilistic inference. This in turn allows us to directly apply a wide variety of popular, practical algorithms to these tasks. In Chapter 1 we provide an overview of the general problem of sequential decision making and a broad description of various solution methods. Much of the remaining work of this thesis then proceeds by relying upon probabilistic reinterpretations of the decision making process. This strategy of reducing learning problems to simpler inference tasks has been shown to be very fruitful in much of machine learning, and we expect similar improvements to arise in the control and reinforcement learning fields. The approaches of Chapters 2–3 build upon the framework of [Toussaint and Storkey, 2006] in reformulating the solution of Markov decision processes instead as maximum-likelihood estimation in an equivalent probabilistic model. In Chapter 2 we utilize this framework to construct an Expectation Maximization algorithm for continuous, linear-Gaussian models with mixture-of-Gaussian rewards. This approach extends popular linearquadratic reward models to a much more general setting. We also show how to extend this probabilistic framework to continuous time processes. Chapter 3 further builds upon these methods to introduce a Bayesian approach to policy search using Markov chain Monte Carlo. In Chapter 4 we depart from the setting of direct policy search and instead consider value function estimation. In particular we utilize leastsquares temporal difference learn-
منابع مشابه
An Introduction to Inference and Learning in Bayesian Networks
Bayesian networks (BNs) are modern tools for modeling phenomena in dynamic and static systems and are used in different subjects such as disease diagnosis, weather forecasting, decision making and clustering. A BN is a graphical-probabilistic model which represents causal relations among random variables and consists of a directed acyclic graph and a set of conditional probabilities. Structure...
متن کاملEvaluation of the Relationship between Assertiveness, Decision Making Styles and Organizational Learning of Health Managers in Shahrekord University of Medical Sciences in 2018
Background & Aim: Managers are always learning to make critical decisions, and learning organizations emphasize continuous learning to survive in the current competitive environment. This study aimed to evaluate the relationship between assertiveness and decision making styles and organizational learning of health managers of Shahrekord University of Medical Sciences. Materials and Methods: Thi...
متن کاملتأثیر یادگیری مبتنی بر الگوریتم بر تصمیمگیری بالینی دانشجویان فوریتهای پزشکی
Introduction: Improvement of students’ clinical decision making is one of the main challenges in medical education. There are numerous ways to improve these skills. The aim of this study was to examine the effect of algorithm-based learning on clinical decision making abilities of medical emergency students. Method: in this experimental study, twenty five medical emergency students were rand...
متن کاملEffectiveness of the Self-determination Educational Package on Self-directed Learning and Decision-making Styles among High School Students
Introduction: The purpose of this study was to develop a self-determination educational package and determine its effectiveness on Self-Directed Learning and Decision making Styles of high school students. Methods: The research method was semi-experimental with pre-test, post-test with the control group and follow up. At first, self-determination educational package was compiled using library s...
متن کاملInfluencing factors on integrating professional learning of secondary mathematics teachers with the analysis, interpretation and decision-making of their teaching
The present study is part of a bigger research project and its purpose was to investigate the influencing factors on integrating professional learning of secondary mathematics teachers with the ways in which they analyze, interpret and make decisions regarding their teaching. For the fine-grained analysis of the first layer of data, phenomenography method was used. Nine teachers were interviewe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013